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Top quarks are produced in pairs through the strong in- 
teraction and each top quark decays dominantly to a W 
boson and a b quark, followed by the W decaying ei- 
ther to a pair of quarks (which form jets) or a lepton 
and a neutrino. This paper describes a measurement of 
the top-antitop pair production cross section, au, in the 
pp ^ tt ^ ivqq'bb channel at a center-of-mass energy, 
^/s = 1.96 TeV using a new methodology to constrain 
background contributions and systematic effects, result- 
ing in an improved sensitivity. 

The jets which originate from the bottom quarks in 
the final state provide an opportunity to select events 
which arc more likely to have come from top quark decays 
than from other processes. A 6-tagging algorithm takes 
advantage of the characteristics — largely the secondary 
vertex displaced from the primary vertex — that distin- 
guish heavy flavor (HF) jets from charm and light flavor 
(LF) jets [3[. This algorithm allows us to reduce the 
backgrounds from W -\- jets processes, which can mimic 
the top decay signature, and was the basis for several 
previous measurements of the top cross section 0, 0] ■ 

While requiring the event to have at least one 6-tagged 
jet reduces the backgrounds, it does not eliminate them. 
It is important to estimate the amount of W boson pro- 
duction with associated jets from heavy flavor, which is 
theoretically difficult and a source of systematic uncer- 
tainties for measurements of the cross section as well as 
the mass. Here we reduce this systematic uncertainty 
and constrain the VF-I-HF background by performing a 
fit to the data which includes regions dominated by W 
+ jets. 

Since the ^-tagging algorithm can incorrectly tag light 
flavor jets as b jets, it is advantageous to apply an ad- 
ditional discriminant to ^-tagged jets to further separate 
processes with jets from bottom, charm, and light flavor. 
This flavor separator is a neural network whose output, 
on a statistical basis, discriminates between &-quark, c- 
quark, and light-flavor jets. The flavor separator uses 25 
variables to output a single number indicating how likely 
a jet is a & jet, where the invariant mass of the secondary 
vertex has the most separation power. The flavor sepa- 
rator was calibrated using data control samples [1] . 

In this paper, we use a flavor separator for the first 
time in the measurement of the tt cross-section. In order 
to constrain the background contributions, we perform 
the fit to the fiavor discriminant in nine samples defined 
by the number of jets, Ujet (1, 2, 3, 4, or > 5), and 
number of 6-tagged jets, ntag (1 or > 2). Events with one 
or two jets are dominated by -I- jets, whereas events 
with three or more jets are largely ti. Events with two b 
tags are dominated by Wbb and tt, whereas events with 
a single b tag are predominantly M^-|-charm and W^-|-LF. 
Previous methods selected events with three or more jets 
in order to reduce the largest background from W + jets 
processes 0, HI, • This new method instead constrains 
the background contribution of the W -f jets processes 



in the region with three or more jets by measuring the 
contributions in the regions with one and two jets. 

We use a data sample corresponding to 2.7 fb^^ of inte- 
grated luminosity, collected from February 2002 through 
April 2008 using the CDF II detector [8[, an approxi- 
mately cylindrically symmetric detector located at the 
Tevatron collider. CDF II is a general-purpose device; 
the central drift chamber provides charged-particle track- 
ing, while the silicon system provides excellent vertex 
and impact parameter resolution, both of which are im- 
portant for identifying bottom quarks. Electromagnetic 
and hadronic calorimeters are located outside the track- 
ing chambers, and provide identification of electrons and 
jets. At the outermost layer of the detector sit the muon 
drift chambers which provide muon identification. 

We select events with a W candidate decaying lep- 
tonically to either an electron or muon. We require 
at least one jet and exactly one lepton candidate both 
with transverse energy Et > 20 GcV and pseudorapid- 
ity \r]\ < 2.0 0]. We require that at least one jet is b- 
tagged, and that there is at least 20 GeV of missing trans- 
verse energy, fx, in the event. To reduce QCD back- 
grounds, we require the transverse mass of the W, = 

y^2(p^p^ -p^p^), to be at least 10 GcV/c^ for 

muons, and at least 20 GeV/c^ for electrons. Electron 
samples have a larger QCD background contamination 
than muon samples, so there we also require the If^T to 
satisfy > -0.05 (iuGeV/c^) + 3.5, where 

the denominator is the square root of the amount of un- 
clustered energy in the direction of the missing transverse 
energy [l3|. 

In addition to QCD multi-jet processes, the final state 
in this analysis can be mimicked by several other pro- 
cesses. W + jets processes are by far the largest source 
of backgrounds. Single top production, di-boson produc- 
tion, and Z + jets processes — collectively referred to as 
electroweak (EW) processes — also contribute. All but 
the QCD multi-jet backgrounds are modeled with Monte 
Carlo simulations; the QCD backgrounds are estimated 
using a data-driven approach. Events that pass the se- 
lection criteria, though with the lepton candidate failing 
any two identification cuts, arc mostly QCD multi-jet 
processes, and this sample is used to model the back- 
ground from these QCD processes. The normalization 
of the QCD background is estimated from a fit to the 
missing Et distribution for each sub-sample, without the 
missing Et requirement. 

Monte Carlo samples are employed to estimate accep- 
tances for the signal and backgrounds, and to model 
relevant distributions used in the fits described below. 
All of the Monte Carlo samples emp loyed were gener- 
ated using either PYTHIA v6.216 [11| (the ti and dibo- 
son samples), MADGRAPH [i2| (the single top sample), or 
ALPGEN v2.10' [l3| with generator-to-reconstructcd-jct 
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matching [IJ, [l5| and PYTHIA v6.326 for showering (the 
W + jets and Z + jets samples). The tt signal Monte 
Carlo sample was generated with the CTEQ5L [3l par- 
ton distribution functions (PDFs) assuming a top mass 
of mt = 175 GeV/c^. All samples are processed through 
a detailed simulation of the CDF II detector response, 
after which they are treated in the same manner as the 
data events. Each of the samples is divided based on the 
number of jets and b tags, and made into templates — 
binned distributions of the flavor separator output. 

The measurement is accomplished as a fit of the flavor 
separator distribution performed simultaneously in the 
nine data sub-samples. Results are obtained by max- 
imizing a binned Poisson likelihood which incorporates 
templates from each of the tt, W + jets, electroweak, 
and QCD processes. The templates are combined after 
initializing them to the predicted yield for data corre- 
sponding to an integrated luminosity of 2.7 fb^^, where 
the initialization factors are functions of cross sections, 
tagging efficiencies, and energy scales which are all pa- 
rameters in the fit. The overall normalization of each 
template is floated in the fit — a single overall normal- 
ization factor is used for each process — and the primary 
result of the fit is a set of those normalizations: the ti 
cross section and relative normalizations, Kp, to the stan- 
dard model expectations for W + jets, electroweak, and 
QCD components. 

Template normalizations also include functions, 
Pxih j, ^£c), that parametrize the effect of a source of sys- 
tematic uncertainty, x, in the sub-sample with ntag = i 
and Ujet = j, as a function of the relative shift, ^x, of 
quantity x, in units of the uncertainty on x. A sepa- 
rate function is employed for each process in each sub- 
sample for each source of systematic uncertainty; an ex- 
ample function is shown in Fig. [T] This leads to a to- 
tal of 12 parameters in the fit — seven normalizations of 
the samples (utt, K^^i,^, Kwce, Kwc, Kw+lf, Kew, 
and Kqcd), and five systematic uncertainty parameters 

/FSR, ^Q2, ^JES)- 

Systematic uncertainties in this measurement can af- 
fect both the normalizations and the shapes of the tem- 
plates. The rate uncertainties are naturally included in 
the fit via the Pxih j,^x) functions, and these system- 
atic uncertainties are refiected by the total fit error. To 
account for each shape uncertainty, we generate an ad- 
ditional set of templates with the variable in question 
changed, re-run the fit, and take the difference in the 
result as the uncertainty. 

We vary the ^-tagging efficiency, mistag rate, and 
the jet energy calibration [l3| by their uncertainties for 
all simulated samples. Initial- and final-state radiation 
(ISR/FSR) are processes in which gluons are radiated be- 
fore or after the collision, respectively. The uncertainty 
arises due to ISR/FSR leading to a larger or smaller num- 
ber of jets in the event. To account for this, we make 
additional sets of tt templates with more or less ISR and 
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FIG. 1. (Color) The function which parameterizes the 
effect of the jet energy scale on the 1-tag templates of 
the ti sample. Each jet and tag bin for each process in 
each subsample has a different function for each source 
of systematic uncertainty. The x axis is in units of the 
systematic shift. 



FSR as compared to the normal settings; the different 
settiiigs are constrained by studies of Drell-Yan produc- 



tion 
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The systematic uncertainty associated to the 
choice of the renormalization and factorization scales is 
estimated by varying these scales between half and twice 
their default values, as indicated in [l3|- This variation 
also accounts for differences in ISR/FSR in W + jets 
processes. 

The uncertainty due to the choice of the algorithm 
used to generate the parton shower was determined by 
comparing the results obtained using pythia and her- 
WIG ■ We account for uncertainties in our modeling of 
the QCD template shape by using electronlikc signals as- 
sociated with multiple tracks, rather than electrons that 
fail identification cuts, to make templates. The fiavor 
separator has a correction factor applied to match its 
mistag rate to the one observed in data; to account for 
this uncertainty, we examine templates without this fac- 
tor applied. The models describing color reconnection — 
i.e., the QCD cross-talk between the decay products of 
the top quarks — are not known precisely, so we account 
for this uncertainty by comparing two different models. 
We take an uncertainty of 0.6% on the top cross section 
due to the PDFs, and an uncertainty of 0.5% due to the 
beam position and lepton identification efficiency. We 
take a conservative 2% uncertainty due to the PDFs on 
the W + jets results. The measured luminosity has an 
uncertainty of 5.9%. 

The total normalizations of the tt and Wbb components 
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in the fit are given by 

Pstagii, j-,(,Btag) ' PMistag{i, j , £,Mistag)- 
PjEs{i,j,ijEs) 



MC 



Wbb ■ "wbb ■ ^ ■ ^'^w ■ ^Wbbi^'j)' 
Pstagih j: ^Btag) ' PMistagih j , £,Mistag , 
PjEs{i,j,ijEs) ■ PQ^{i,j,^Q^) 



(1) 



(2) 



where ati is the cross section and K^r^,l is the relative nor- 
mahzation factor; cr^^*^ is the cross section from Monte 
Carlo simulations; L — JCdt is the integrated luminosity; 
F;^''^^{i,j) is the Monte Carlo prediction for the fraction 
of events with i b tags and j jets, including reconstruc- 
tion and selection efficiencies; 3^^, is a factor of 1.54 
obtained from the ratio of the measured W + jets cross 
section to the ALPGEN-prediction cross section — this 
is necessary due to ALPGEN being a leading-order event 
generator; and Pxih j,^x) are functions for each source 
of systematic uncertainty, x. Normalizations of the other 
five samples are obtained in a similar manner. For refer- 
ence and easier comparison to other measurements, the 
cross sections for K = 1 arc calculated from ALPGEN as 
2744.1 pb for W+LF, 31.9 pb for Wc, 13.1 pb for Wcc, 
and 6.8 pb for Wbb. 

The data and best fit to the flavor separator distri- 
bution are shown in Fig. [51 The tt production cross 
section is found to be att ~ 7 
normalization factors are K 



7-64^o.54 pb, and relative 



0.28 

Wbb 

+0.90 _ 1 eo+0.34 _ r, qo+0.34 ly- 

1 10+°-^° 



0.83^g;7^, Kwc 



0.82;°;^^. These results in- 



and Kqcd 

elude statistical and systematic uncertainties, but do not 
include an uncertainty due to the luminosity. The top 
cross section we measure is consistent with theoretical 



predictions j2l| - |24j . and the values of the relative nor- 
malizations are consistent with what is expected from the 
theoretical uncertainty of the leading-order cross sections 
used by the Monte Carlo simulation generators [l3|, [14 1 . 

In order to evaluate the performance of this new 
method, we have compared the estimates of the system- 
atic uncertainties on the top cross section with the pre- 
vious method of background estimation [3], though as 
applied to the same integrated luminosity. A summary 
of these comparisons is shown in Table HI The total un- 
certainty drops from 0.84 pb to 0.73 pb, which is a 13% 
improvement. However, the previous result developed a 
normalization to the Z cross section to reduce the lu- 
minosity uncertainty dramatically, and this method can 
be extended in the future to include that improvement. 
Therefore, upon excluding the luminosity uncertainty in 
order to better compare the methods, the uncertainty 
drops from 0.72 pb to 0.57 pb for a 21% improvement. 

In summary, we measured the top pair production 
cross section using a novel method for estimating back- 
ground contributions with CDF II data corresponding to 
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FIG. 2. (Color) The data and best fit for the fiavor 
separator distribution for samples defined by the 
number of jets and number of tags. The legend is 
located in the "2 Jets 1 Tag" bin. Note that the flavor 
separator distribution is divided into fewer bins for 
samples with fewer events. For samples with two or 
more tags, we show the average of the flavor separator 
output from the two highcst-p^ tagged jets. 



TABLE I. Comparison of systematic uncertainties 
between this result and the previous method of 
background estimation f?\. 



Uncertainty 


Previous Method 


This Result 


Statistical 


0.36 pb 


0.33 pb 


HF K-Factor 


0.27 pb 


Inc in stat 




N/A 


0.21 pb 


B Tagging 


0.39 pb 


0.23 pb 


Mistags 


0.17 pb 


0.08 pb 


JES 


0.29 pb 


0.29 pb 


ISR/FSR 


0.06 pb 


0.01 pb 


Parton Showering 


0.21 pb 


0.11 pb 


QCD Shape 


0.06 pb 


0.01 pb 


Flavor Separator Correction 


N/A 


0.10 pb 


Color Reconnection 


N/A 


0.03 pb 


PDF 


0.04 pb 


0.05 pb 


Lepton ID / trigger 


0.04 pb 


0.05 pb 


ZO 


0.02 pb 


0.02 pb 


Sub-Total 


0.72 pb 


0.57 pb 


Luminosity 


0.43 pb 


0.45 pb 


Total 


0.84 pb 


0.73 pb 



an integrated luminosity of 2.7 fb~ . The cross section we 
measure, 7.64 ± 0.57 (stat -I- syst) ± 0.45 (luminosity), is 
consistent with the standard model next-to-leading order 
theoretical calculation [S^], and the background contri- 
butions are consistent with other predictions 0. Com- 
pared to the previous method of background estimation, 
using 6-tagging, this new method improves the precision 
on the top quark pair production cross section by 21%, 
excluding luminosity uncertainties. 
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